Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 3096 |
| Missing cells | 11152 |
| Missing cells (%) | 16.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 532.3 KiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 11 |
State Name has constant value "" | Constant |
District Name is highly overall correlated with Water_Body_Nature and 2 other fields | High correlation |
Original_Storage_Capacity is highly overall correlated with Present_Storage_Capacity and 1 other fields | High correlation |
Present_Storage_Capacity is highly overall correlated with Original_Storage_Capacity and 1 other fields | High correlation |
Reason_for_Water_Body_Use is highly overall correlated with Water_Body_Nature and 2 other fields | High correlation |
Renovation_Year is highly overall correlated with construcion_year | High correlation |
Water_Body_Nature is highly overall correlated with District Name and 2 other fields | High correlation |
Water_body_in_use is highly overall correlated with District Name and 4 other fields | High correlation |
construcion_year is highly overall correlated with Renovation_Year | High correlation |
construction_cost is highly overall correlated with renovation_cost | High correlation |
df_index is highly overall correlated with level_0 | High correlation |
filled_up_storage_name is highly overall correlated with Water_body_in_use and 1 other fields | High correlation |
filled_up_storage_space_name is highly overall correlated with Water_body_in_use and 1 other fields | High correlation |
level_0 is highly overall correlated with df_index | High correlation |
no_people_benefited_by_water_body is highly overall correlated with Reason_for_Water_Body_Use and 1 other fields | High correlation |
population_density_benefited is highly overall correlated with renovation_cost | High correlation |
renovation_cost is highly overall correlated with District Name and 2 other fields | High correlation |
storage_capacity_change is highly overall correlated with Original_Storage_Capacity and 2 other fields | High correlation |
Area_Type is highly imbalanced (75.5%) | Imbalance |
Water_Body_Type is highly imbalanced (65.0%) | Imbalance |
Repair_Renovation_Status is highly imbalanced (97.1%) | Imbalance |
construcion_year has 1654 (53.4%) missing values | Missing |
construction_cost has 1654 (53.4%) missing values | Missing |
Renovation_Year has 2789 (90.1%) missing values | Missing |
renovation_cost has 2789 (90.1%) missing values | Missing |
filled_up_storage_name has 46 (1.5%) missing values | Missing |
filled_up_storage_space_name has 46 (1.5%) missing values | Missing |
reason_water_body_in_use_name2 has 2174 (70.2%) missing values | Missing |
construction_cost is highly skewed (γ1 = 29.2937204) | Skewed |
Original_Storage_Capacity is highly skewed (γ1 = 55.58769145) | Skewed |
Present_Storage_Capacity is highly skewed (γ1 = 55.59527059) | Skewed |
no_people_benefited_by_water_body is highly skewed (γ1 = 22.83601348) | Skewed |
storage_capacity_change is highly skewed (γ1 = -55.52042306) | Skewed |
population_density_benefited is highly skewed (γ1 = 55.092668) | Skewed |
level_0 is uniformly distributed | Uniform |
df_index is uniformly distributed | Uniform |
level_0 has unique values | Unique |
df_index has unique values | Unique |
Original_Storage_Capacity has 46 (1.5%) zeros | Zeros |
Present_Storage_Capacity has 46 (1.5%) zeros | Zeros |
storage_capacity_change has 518 (16.7%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-09 13:04:39.927157 |
|---|---|
| Analysis finished | 2023-12-09 13:04:52.810204 |
| Duration | 12.88 seconds |
| Software version | ydata-profiling vv4.6.2 |
| Download configuration | config.json |
level_0
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 3096 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1547.5 |
| Minimum | 0 |
|---|---|
| Maximum | 3095 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 154.75 |
| Q1 | 773.75 |
| median | 1547.5 |
| Q3 | 2321.25 |
| 95-th percentile | 2940.25 |
| Maximum | 3095 |
| Range | 3095 |
| Interquartile range (IQR) | 1547.5 |
Descriptive statistics
| Standard deviation | 893.88254 |
|---|---|
| Coefficient of variation (CV) | 0.57763008 |
| Kurtosis | -1.2 |
| Mean | 1547.5 |
| Median Absolute Deviation (MAD) | 774 |
| Skewness | 0 |
| Sum | 4791060 |
| Variance | 799026 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2032 | 1 | < 0.1% |
| 2058 | 1 | < 0.1% |
| 2059 | 1 | < 0.1% |
| 2060 | 1 | < 0.1% |
| 2061 | 1 | < 0.1% |
| 2062 | 1 | < 0.1% |
| 2063 | 1 | < 0.1% |
| 2064 | 1 | < 0.1% |
| 2065 | 1 | < 0.1% |
| Other values (3086) | 3086 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 3095 | 1 | |
| 3094 | 1 | |
| 3093 | 1 | |
| 3092 | 1 | |
| 3091 | 1 | |
| 3090 | 1 | |
| 3089 | 1 | |
| 3088 | 1 | |
| 3087 | 1 | |
| 3086 | 1 |
df_index
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 3096 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1547.5 |
| Minimum | 0 |
|---|---|
| Maximum | 3095 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 154.75 |
| Q1 | 773.75 |
| median | 1547.5 |
| Q3 | 2321.25 |
| 95-th percentile | 2940.25 |
| Maximum | 3095 |
| Range | 3095 |
| Interquartile range (IQR) | 1547.5 |
Descriptive statistics
| Standard deviation | 893.88254 |
|---|---|
| Coefficient of variation (CV) | 0.57763008 |
| Kurtosis | -1.2 |
| Mean | 1547.5 |
| Median Absolute Deviation (MAD) | 774 |
| Skewness | 0 |
| Sum | 4791060 |
| Variance | 799026 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2032 | 1 | < 0.1% |
| 2058 | 1 | < 0.1% |
| 2059 | 1 | < 0.1% |
| 2060 | 1 | < 0.1% |
| 2061 | 1 | < 0.1% |
| 2062 | 1 | < 0.1% |
| 2063 | 1 | < 0.1% |
| 2064 | 1 | < 0.1% |
| 2065 | 1 | < 0.1% |
| Other values (3086) | 3086 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 3095 | 1 | |
| 3094 | 1 | |
| 3093 | 1 | |
| 3092 | 1 | |
| 3091 | 1 | |
| 3090 | 1 | |
| 3089 | 1 | |
| 3088 | 1 | |
| 3087 | 1 | |
| 3086 | 1 |
Area_Type
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| Rural | |
|---|---|
| Urban | 126 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 15480 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rural |
|---|---|
| 2nd row | Rural |
| 3rd row | Rural |
| 4th row | Rural |
| 5th row | Rural |
Common Values
| Value | Count | Frequency (%) |
| Rural | 2970 | |
| Urban | 126 | 4.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rural | 2970 | |
| urban | 126 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 3096 | |
| a | 3096 | |
| R | 2970 | |
| u | 2970 | |
| l | 2970 | |
| U | 126 | 0.8% |
| b | 126 | 0.8% |
| n | 126 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12384 | |
| Uppercase Letter | 3096 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 3096 | |
| a | 3096 | |
| u | 2970 | |
| l | 2970 | |
| b | 126 | 1.0% |
| n | 126 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2970 | |
| U | 126 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15480 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 3096 | |
| a | 3096 | |
| R | 2970 | |
| u | 2970 | |
| l | 2970 | |
| U | 126 | 0.8% |
| b | 126 | 0.8% |
| n | 126 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15480 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 3096 | |
| a | 3096 | |
| R | 2970 | |
| u | 2970 | |
| l | 2970 | |
| U | 126 | 0.8% |
| b | 126 | 0.8% |
| n | 126 | 0.8% |
State Name
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| UTTARAKHAND |
|---|
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 34056 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UTTARAKHAND |
|---|---|
| 2nd row | UTTARAKHAND |
| 3rd row | UTTARAKHAND |
| 4th row | UTTARAKHAND |
| 5th row | UTTARAKHAND |
Common Values
| Value | Count | Frequency (%) |
| UTTARAKHAND | 3096 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| uttarakhand | 3096 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 9288 | |
| T | 6192 | |
| U | 3096 | 9.1% |
| R | 3096 | 9.1% |
| K | 3096 | 9.1% |
| H | 3096 | 9.1% |
| N | 3096 | 9.1% |
| D | 3096 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 34056 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9288 | |
| T | 6192 | |
| U | 3096 | 9.1% |
| R | 3096 | 9.1% |
| K | 3096 | 9.1% |
| H | 3096 | 9.1% |
| N | 3096 | 9.1% |
| D | 3096 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34056 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 9288 | |
| T | 6192 | |
| U | 3096 | 9.1% |
| R | 3096 | 9.1% |
| K | 3096 | 9.1% |
| H | 3096 | 9.1% |
| N | 3096 | 9.1% |
| D | 3096 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34056 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 9288 | |
| T | 6192 | |
| U | 3096 | 9.1% |
| R | 3096 | 9.1% |
| K | 3096 | 9.1% |
| H | 3096 | 9.1% |
| N | 3096 | 9.1% |
| D | 3096 | 9.1% |
District Name
Categorical
HIGH CORRELATION 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| HARIDWAR | |
|---|---|
| UDHAM SINGH NAGAR | |
| DEHRADUN | |
| ALMORA | |
| CHAMPAWAT | |
| Other values (8) |
Length
| Max length | 17 |
|---|---|
| Median length | 11 |
| Mean length | 10.51938 |
| Min length | 5 |
Characters and Unicode
| Total characters | 32568 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UDHAM SINGH NAGAR |
|---|---|
| 2nd row | UDHAM SINGH NAGAR |
| 3rd row | UDHAM SINGH NAGAR |
| 4th row | UDHAM SINGH NAGAR |
| 5th row | HARIDWAR |
Common Values
| Value | Count | Frequency (%) |
| HARIDWAR | 1065 | |
| UDHAM SINGH NAGAR | 897 | |
| DEHRADUN | 210 | 6.8% |
| ALMORA | 178 | 5.7% |
| CHAMPAWAT | 143 | 4.6% |
| BAGESHWAR | 115 | 3.7% |
| PITHORGARH | 93 | 3.0% |
| PAURI | 80 | 2.6% |
| NANITAL | 79 | 2.6% |
| TEHRI | 75 | 2.4% |
| Other values (3) | 161 | 5.2% |
Length
| Value | Count | Frequency (%) |
| haridwar | 1065 | |
| udham | 897 | |
| singh | 897 | |
| nagar | 897 | |
| dehradun | 210 | 4.3% |
| almora | 178 | 3.6% |
| champawat | 143 | 2.9% |
| bageshwar | 115 | 2.4% |
| pithorgarh | 93 | 1.9% |
| pauri | 80 | 1.6% |
| Other values (5) | 315 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6690 | |
| R | 4079 | |
| H | 3693 | |
| D | 2438 | 7.5% |
| I | 2394 | 7.4% |
| N | 2162 | 6.6% |
| G | 2058 | 6.3% |
| 1794 | 5.5% | |
| W | 1323 | 4.1% |
| U | 1283 | 3.9% |
| Other values (11) | 4654 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 30774 | |
| Space Separator | 1794 | 5.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6690 | |
| R | 4079 | |
| H | 3693 | |
| D | 2438 | 7.9% |
| I | 2394 | 7.8% |
| N | 2162 | 7.0% |
| G | 2058 | 6.7% |
| W | 1323 | 4.3% |
| U | 1283 | 4.2% |
| M | 1283 | 4.2% |
| Other values (10) | 3371 |
Space Separator
| Value | Count | Frequency (%) |
| 1794 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30774 | |
| Common | 1794 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6690 | |
| R | 4079 | |
| H | 3693 | |
| D | 2438 | 7.9% |
| I | 2394 | 7.8% |
| N | 2162 | 7.0% |
| G | 2058 | 6.7% |
| W | 1323 | 4.3% |
| U | 1283 | 4.2% |
| M | 1283 | 4.2% |
| Other values (10) | 3371 |
Common
| Value | Count | Frequency (%) |
| 1794 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6690 | |
| R | 4079 | |
| H | 3693 | |
| D | 2438 | 7.5% |
| I | 2394 | 7.4% |
| N | 2162 | 6.6% |
| G | 2058 | 6.3% |
| 1794 | 5.5% | |
| W | 1323 | 4.1% |
| U | 1283 | 3.9% |
| Other values (11) | 4654 |
Water_Body_Type
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| Ponds | |
|---|---|
| Tank | |
| Lakes | 48 |
| Water consv schemes/percolation tanks/check-dams | 41 |
| Reservoirs | 27 |
Length
| Max length | 48 |
|---|---|
| Median length | 5 |
| Mean length | 5.4657623 |
| Min length | 4 |
Characters and Unicode
| Total characters | 16922 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ponds |
|---|---|
| 2nd row | Ponds |
| 3rd row | Ponds |
| 4th row | Ponds |
| 5th row | Ponds |
Common Values
| Value | Count | Frequency (%) |
| Ponds | 2514 | |
| Tank | 461 | 14.9% |
| Lakes | 48 | 1.6% |
| Water consv schemes/percolation tanks/check-dams | 41 | 1.3% |
| Reservoirs | 27 | 0.9% |
| Others | 5 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ponds | 2514 | |
| tank | 461 | 14.3% |
| lakes | 48 | 1.5% |
| water | 41 | 1.3% |
| consv | 41 | 1.3% |
| schemes/percolation | 41 | 1.3% |
| tanks/check-dams | 41 | 1.3% |
| reservoirs | 27 | 0.8% |
| others | 5 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 3098 | |
| s | 2826 | |
| o | 2664 | |
| d | 2555 | |
| P | 2514 | |
| a | 673 | 4.0% |
| k | 591 | 3.5% |
| T | 461 | 2.7% |
| e | 312 | 1.8% |
| c | 205 | 1.2% |
| Other values (15) | 1023 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13580 | |
| Uppercase Letter | 3096 | 18.3% |
| Space Separator | 123 | 0.7% |
| Other Punctuation | 82 | 0.5% |
| Dash Punctuation | 41 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 3098 | |
| s | 2826 | |
| o | 2664 | |
| d | 2555 | |
| a | 673 | 5.0% |
| k | 591 | 4.4% |
| e | 312 | 2.3% |
| c | 205 | 1.5% |
| r | 141 | 1.0% |
| t | 128 | 0.9% |
| Other values (6) | 387 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2514 | |
| T | 461 | 14.9% |
| L | 48 | 1.6% |
| W | 41 | 1.3% |
| R | 27 | 0.9% |
| O | 5 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 123 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 82 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16676 | |
| Common | 246 | 1.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 3098 | |
| s | 2826 | |
| o | 2664 | |
| d | 2555 | |
| P | 2514 | |
| a | 673 | 4.0% |
| k | 591 | 3.5% |
| T | 461 | 2.8% |
| e | 312 | 1.9% |
| c | 205 | 1.2% |
| Other values (12) | 777 | 4.7% |
Common
| Value | Count | Frequency (%) |
| 123 | ||
| / | 82 | |
| - | 41 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16922 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 3098 | |
| s | 2826 | |
| o | 2664 | |
| d | 2555 | |
| P | 2514 | |
| a | 673 | 4.0% |
| k | 591 | 3.5% |
| T | 461 | 2.7% |
| e | 312 | 1.8% |
| c | 205 | 1.2% |
| Other values (15) | 1023 | 6.0% |
Water_body_in_use
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3096 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2371 | |
| 0 | 725 | 23.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2371 | |
| 0 | 725 | 23.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2371 | |
| 0 | 725 | 23.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3096 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2371 | |
| 0 | 725 | 23.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3096 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2371 | |
| 0 | 725 | 23.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3096 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2371 | |
| 0 | 725 | 23.4% |
Reason_for_Water_Body_Use
Categorical
HIGH CORRELATION 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| Ground water recharge | |
|---|---|
| other | |
| Pisciculture | |
| Irrigation | |
| Other | 101 |
| Other values (4) | 70 |
Length
| Max length | 21 |
|---|---|
| Median length | 17 |
| Mean length | 13.593023 |
| Min length | 5 |
Characters and Unicode
| Total characters | 42084 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | other |
|---|---|
| 2nd row | other |
| 3rd row | other |
| 4th row | other |
| 5th row | Ground water recharge |
Common Values
| Value | Count | Frequency (%) |
| Ground water recharge | 1267 | |
| other | 725 | |
| Pisciculture | 611 | |
| Irrigation | 322 | 10.4% |
| Other | 101 | 3.3% |
| Recreation | 26 | 0.8% |
| Religious | 17 | 0.5% |
| Domestic/Drinking | 16 | 0.5% |
| Industrial | 11 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ground | 1267 | |
| water | 1267 | |
| recharge | 1267 | |
| other | 826 | |
| pisciculture | 611 | |
| irrigation | 322 | 5.7% |
| recreation | 26 | 0.5% |
| religious | 17 | 0.3% |
| domestic/drinking | 16 | 0.3% |
| industrial | 11 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 7202 | |
| e | 5323 | |
| t | 3079 | 7.3% |
| a | 2893 | 6.9% |
| 2534 | 6.0% | |
| c | 2531 | 6.0% |
| u | 2517 | 6.0% |
| o | 2373 | 5.6% |
| h | 2093 | 5.0% |
| i | 1985 | 4.7% |
| Other values (15) | 9554 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37147 | |
| Space Separator | 2534 | 6.0% |
| Uppercase Letter | 2387 | 5.7% |
| Other Punctuation | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 7202 | |
| e | 5323 | |
| t | 3079 | |
| a | 2893 | |
| c | 2531 | 6.8% |
| u | 2517 | 6.8% |
| o | 2373 | 6.4% |
| h | 2093 | 5.6% |
| i | 1985 | 5.3% |
| n | 1658 | 4.5% |
| Other values (7) | 5493 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1267 | |
| P | 611 | |
| I | 333 | 14.0% |
| O | 101 | 4.2% |
| R | 43 | 1.8% |
| D | 32 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2534 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39534 | |
| Common | 2550 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 7202 | |
| e | 5323 | |
| t | 3079 | 7.8% |
| a | 2893 | 7.3% |
| c | 2531 | 6.4% |
| u | 2517 | 6.4% |
| o | 2373 | 6.0% |
| h | 2093 | 5.3% |
| i | 1985 | 5.0% |
| n | 1658 | 4.2% |
| Other values (13) | 7880 |
Common
| Value | Count | Frequency (%) |
| 2534 | ||
| / | 16 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42084 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 7202 | |
| e | 5323 | |
| t | 3079 | 7.3% |
| a | 2893 | 6.9% |
| 2534 | 6.0% | |
| c | 2531 | 6.0% |
| u | 2517 | 6.0% |
| o | 2373 | 5.6% |
| h | 2093 | 5.0% |
| i | 1985 | 4.7% |
| Other values (15) | 9554 |
Water_Body_Nature
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| Natural | |
|---|---|
| Man-made |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.4657623 |
| Min length | 7 |
Characters and Unicode
| Total characters | 23114 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Natural |
|---|---|
| 2nd row | Man-made |
| 3rd row | Man-made |
| 4th row | Man-made |
| 5th row | Natural |
Common Values
| Value | Count | Frequency (%) |
| Natural | 1654 | |
| Man-made | 1442 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| natural | 1654 | |
| man-made | 1442 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6192 | |
| N | 1654 | 7.2% |
| t | 1654 | 7.2% |
| u | 1654 | 7.2% |
| r | 1654 | 7.2% |
| l | 1654 | 7.2% |
| M | 1442 | 6.2% |
| n | 1442 | 6.2% |
| - | 1442 | 6.2% |
| m | 1442 | 6.2% |
| Other values (2) | 2884 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18576 | |
| Uppercase Letter | 3096 | 13.4% |
| Dash Punctuation | 1442 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6192 | |
| t | 1654 | 8.9% |
| u | 1654 | 8.9% |
| r | 1654 | 8.9% |
| l | 1654 | 8.9% |
| n | 1442 | 7.8% |
| m | 1442 | 7.8% |
| d | 1442 | 7.8% |
| e | 1442 | 7.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1654 | |
| M | 1442 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1442 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21672 | |
| Common | 1442 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6192 | |
| N | 1654 | 7.6% |
| t | 1654 | 7.6% |
| u | 1654 | 7.6% |
| r | 1654 | 7.6% |
| l | 1654 | 7.6% |
| M | 1442 | 6.7% |
| n | 1442 | 6.7% |
| m | 1442 | 6.7% |
| d | 1442 | 6.7% |
Common
| Value | Count | Frequency (%) |
| - | 1442 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23114 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6192 | |
| N | 1654 | 7.2% |
| t | 1654 | 7.2% |
| u | 1654 | 7.2% |
| r | 1654 | 7.2% |
| l | 1654 | 7.2% |
| M | 1442 | 6.2% |
| n | 1442 | 6.2% |
| - | 1442 | 6.2% |
| m | 1442 | 6.2% |
| Other values (2) | 2884 |
construcion_year
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 42 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 1654 |
| Missing (%) | 53.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2013.4612 |
| Minimum | 1905 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 1905 |
|---|---|
| 5-th percentile | 2001 |
| Q1 | 2014 |
| median | 2016 |
| Q3 | 2017 |
| 95-th percentile | 2019 |
| Maximum | 2020 |
| Range | 115 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 7.9539063 |
|---|---|
| Coefficient of variation (CV) | 0.0039503649 |
| Kurtosis | 44.119486 |
| Mean | 2013.4612 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -5.1630476 |
| Sum | 2903411 |
| Variance | 63.264625 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 334 | 10.8% |
| 2016 | 213 | 6.9% |
| 2018 | 173 | 5.6% |
| 2015 | 157 | 5.1% |
| 2014 | 118 | 3.8% |
| 2019 | 72 | 2.3% |
| 2002 | 44 | 1.4% |
| 2013 | 40 | 1.3% |
| 2010 | 40 | 1.3% |
| 2005 | 35 | 1.1% |
| Other values (32) | 216 | 7.0% |
| (Missing) | 1654 |
| Value | Count | Frequency (%) |
| 1905 | 1 | |
| 1936 | 1 | |
| 1947 | 1 | |
| 1950 | 2 | |
| 1960 | 1 | |
| 1962 | 1 | |
| 1965 | 1 | |
| 1967 | 1 | |
| 1970 | 1 | |
| 1972 | 1 |
| Value | Count | Frequency (%) |
| 2020 | 20 | 0.6% |
| 2019 | 72 | 2.3% |
| 2018 | 173 | |
| 2017 | 334 | |
| 2016 | 213 | |
| 2015 | 157 | |
| 2014 | 118 | 3.8% |
| 2013 | 40 | 1.3% |
| 2012 | 34 | 1.1% |
| 2011 | 17 | 0.5% |
construction_cost
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 184 |
|---|---|
| Distinct (%) | 12.8% |
| Missing | 1654 |
| Missing (%) | 53.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12413710 |
| Minimum | 0 |
|---|---|
| Maximum | 8.39 × 109 |
| Zeros | 3 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 24050 |
| Q1 | 61250 |
| median | 90000 |
| Q3 | 200000 |
| 95-th percentile | 437850 |
| Maximum | 8.39 × 109 |
| Range | 8.39 × 109 |
| Interquartile range (IQR) | 138750 |
Descriptive statistics
| Standard deviation | 2.4528547 × 108 |
|---|---|
| Coefficient of variation (CV) | 19.759239 |
| Kurtosis | 959.95713 |
| Mean | 12413710 |
| Median Absolute Deviation (MAD) | 40000 |
| Skewness | 29.29372 |
| Sum | 1.790057 × 1010 |
| Variance | 6.016496 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 161 | 5.2% |
| 80000 | 145 | 4.7% |
| 90000 | 112 | 3.6% |
| 200000 | 104 | 3.4% |
| 50000 | 92 | 3.0% |
| 150000 | 75 | 2.4% |
| 40000 | 65 | 2.1% |
| 250000 | 64 | 2.1% |
| 300000 | 50 | 1.6% |
| 60000 | 48 | 1.6% |
| Other values (174) | 526 | 17.0% |
| (Missing) | 1654 |
| Value | Count | Frequency (%) |
| 0 | 3 | 0.1% |
| 1 | 19 | |
| 2 | 1 | < 0.1% |
| 50 | 3 | 0.1% |
| 1850 | 1 | < 0.1% |
| 6000 | 1 | < 0.1% |
| 10000 | 5 | 0.2% |
| 11000 | 1 | < 0.1% |
| 12000 | 4 | 0.1% |
| 14000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8390000000 | 1 | |
| 2466900000 | 1 | |
| 2400000000 | 1 | |
| 1500000000 | 1 | |
| 1200000000 | 1 | |
| 900000000 | 1 | |
| 380400000 | 1 | |
| 168316000 | 1 | |
| 73800000 | 1 | |
| 36000000 | 1 |
Renovation_Year
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 2789 |
| Missing (%) | 90.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2013.7101 |
| Minimum | 1995 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 1995 |
|---|---|
| 5-th percentile | 2008 |
| Q1 | 2012 |
| median | 2015 |
| Q3 | 2017 |
| 95-th percentile | 2018 |
| Maximum | 2020 |
| Range | 25 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.7838661 |
|---|---|
| Coefficient of variation (CV) | 0.0018790521 |
| Kurtosis | 2.2060216 |
| Mean | 2013.7101 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -1.0220579 |
| Sum | 618209 |
| Variance | 14.317643 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2012 | 67 | 2.2% |
| 2017 | 61 | 2.0% |
| 2016 | 39 | 1.3% |
| 2010 | 31 | 1.0% |
| 2015 | 23 | 0.7% |
| 2008 | 21 | 0.7% |
| 2018 | 18 | 0.6% |
| 2019 | 10 | 0.3% |
| 2013 | 9 | 0.3% |
| 2011 | 6 | 0.2% |
| Other values (9) | 22 | 0.7% |
| (Missing) | 2789 |
| Value | Count | Frequency (%) |
| 1995 | 1 | < 0.1% |
| 1998 | 1 | < 0.1% |
| 2000 | 1 | < 0.1% |
| 2002 | 1 | < 0.1% |
| 2005 | 3 | 0.1% |
| 2006 | 2 | 0.1% |
| 2008 | 21 | |
| 2009 | 2 | 0.1% |
| 2010 | 31 | |
| 2011 | 6 | 0.2% |
| Value | Count | Frequency (%) |
| 2020 | 5 | 0.2% |
| 2019 | 10 | 0.3% |
| 2018 | 18 | 0.6% |
| 2017 | 61 | |
| 2016 | 39 | |
| 2015 | 23 | 0.7% |
| 2014 | 6 | 0.2% |
| 2013 | 9 | 0.3% |
| 2012 | 67 | |
| 2011 | 6 | 0.2% |
renovation_cost
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 76 |
|---|---|
| Distinct (%) | 24.8% |
| Missing | 2789 |
| Missing (%) | 90.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 352597.3 |
| Minimum | 1000 |
|---|---|
| Maximum | 46300000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 5000 |
| Q1 | 8000 |
| median | 20000 |
| Q3 | 80000 |
| 95-th percentile | 447800 |
| Maximum | 46300000 |
| Range | 46299000 |
| Interquartile range (IQR) | 72000 |
Descriptive statistics
| Standard deviation | 2973615.1 |
|---|---|
| Coefficient of variation (CV) | 8.4334597 |
| Kurtosis | 194.76923 |
| Mean | 352597.3 |
| Median Absolute Deviation (MAD) | 14000 |
| Skewness | 13.365617 |
| Sum | 1.0824737 × 108 |
| Variance | 8.8423868 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 45 | 1.5% |
| 20000 | 33 | 1.1% |
| 5000 | 33 | 1.1% |
| 8000 | 23 | 0.7% |
| 100000 | 19 | 0.6% |
| 6000 | 12 | 0.4% |
| 25000 | 11 | 0.4% |
| 50000 | 9 | 0.3% |
| 200000 | 8 | 0.3% |
| 3000 | 8 | 0.3% |
| Other values (66) | 106 | 3.4% |
| (Missing) | 2789 |
| Value | Count | Frequency (%) |
| 1000 | 2 | 0.1% |
| 2000 | 2 | 0.1% |
| 3000 | 8 | 0.3% |
| 4000 | 1 | < 0.1% |
| 5000 | 33 | |
| 5600 | 1 | < 0.1% |
| 6000 | 12 | 0.4% |
| 7000 | 1 | < 0.1% |
| 8000 | 23 | |
| 8010 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 46300000 | 1 | |
| 20500000 | 1 | |
| 12000000 | 1 | |
| 4650304 | 1 | |
| 2000000 | 1 | |
| 1442000 | 1 | |
| 1390000 | 1 | |
| 900000 | 1 | |
| 852000 | 1 | |
| 780000 | 1 |
Repair_Renovation_Status
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.3 KiB |
| 0 | |
|---|---|
| 1 | 9 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3096 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3087 | |
| 1 | 9 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3087 | |
| 1 | 9 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3087 | |
| 1 | 9 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3096 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3087 | |
| 1 | 9 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3096 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3087 | |
| 1 | 9 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3096 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3087 | |
| 1 | 9 | 0.3% |
Original_Storage_Capacity
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 1346 |
|---|---|
| Distinct (%) | 43.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1817741.5 |
| Minimum | 0 |
|---|---|
| Maximum | 5.3367534 × 109 |
| Zeros | 46 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 150 |
| median | 1849 |
| Q3 | 6000 |
| 95-th percentile | 27000 |
| Maximum | 5.3367534 × 109 |
| Range | 5.3367534 × 109 |
| Interquartile range (IQR) | 5850 |
Descriptive statistics
| Standard deviation | 95942479 |
|---|---|
| Coefficient of variation (CV) | 52.781146 |
| Kurtosis | 3091.9521 |
| Mean | 1817741.5 |
| Median Absolute Deviation (MAD) | 1793 |
| Skewness | 55.587691 |
| Sum | 5.6277276 × 109 |
| Variance | 9.2049593 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 109 | 3.5% |
| 50 | 55 | 1.8% |
| 40 | 48 | 1.6% |
| 0 | 46 | 1.5% |
| 300 | 44 | 1.4% |
| 200 | 42 | 1.4% |
| 100 | 39 | 1.3% |
| 2400 | 38 | 1.2% |
| 30 | 34 | 1.1% |
| 60 | 32 | 1.0% |
| Other values (1336) | 2609 |
| Value | Count | Frequency (%) |
| 0 | 46 | |
| 1 | 3 | 0.1% |
| 2 | 2 | 0.1% |
| 4 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 109 | |
| 15 | 3 | 0.1% |
| 16 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 5336753400 | 1 | |
| 97560000 | 1 | |
| 67680000 | 1 | |
| 59820000 | 1 | |
| 30000002 | 1 | |
| 7504000 | 1 | |
| 1600000 | 1 | |
| 1352000 | 1 | |
| 985159 | 1 | |
| 708300 | 1 |
Present_Storage_Capacity
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 1176 |
|---|---|
| Distinct (%) | 38.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 899714.91 |
| Minimum | 0 |
|---|---|
| Maximum | 2.6683767 × 109 |
| Zeros | 46 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 100 |
| median | 1031.5 |
| Q3 | 3550.75 |
| 95-th percentile | 18000 |
| Maximum | 2.6683767 × 109 |
| Range | 2.6683767 × 109 |
| Interquartile range (IQR) | 3450.75 |
Descriptive statistics
| Standard deviation | 47969255 |
|---|---|
| Coefficient of variation (CV) | 53.316061 |
| Kurtosis | 3092.507 |
| Mean | 899714.91 |
| Median Absolute Deviation (MAD) | 991.5 |
| Skewness | 55.595271 |
| Sum | 2.7855174 × 109 |
| Variance | 2.3010494 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 142 | 4.6% |
| 50 | 64 | 2.1% |
| 100 | 63 | 2.0% |
| 0 | 46 | 1.5% |
| 20 | 45 | 1.5% |
| 1200 | 45 | 1.5% |
| 200 | 43 | 1.4% |
| 40 | 42 | 1.4% |
| 300 | 37 | 1.2% |
| 1 | 35 | 1.1% |
| Other values (1166) | 2534 |
| Value | Count | Frequency (%) |
| 0 | 46 | 1.5% |
| 1 | 35 | 1.1% |
| 2 | 3 | 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 9 | 0.3% |
| 6 | 1 | < 0.1% |
| 8 | 3 | 0.1% |
| 10 | 142 | |
| 12 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2668376700 | 1 | |
| 59820000 | 1 | |
| 16260000 | 1 | |
| 11280000 | 1 | |
| 6015600 | 1 | |
| 2500000 | 1 | |
| 1500000 | 1 | |
| 1352000 | 1 | |
| 895599 | 1 | |
| 700000 | 1 |
filled_up_storage_name
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 46 |
| Missing (%) | 1.5% |
| Memory size | 24.3 KiB |
| Full | |
|---|---|
| Upto 1/2 | |
| Upto 3/4 | |
| Nil/Negligible filled up | |
| Upto 1/4 |
Length
| Max length | 24 |
|---|---|
| Median length | 8 |
| Mean length | 7.7534426 |
| Min length | 4 |
Characters and Unicode
| Total characters | 23648 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nil/Negligible filled up |
|---|---|
| 2nd row | Nil/Negligible filled up |
| 3rd row | Upto 1/4 |
| 4th row | Upto 1/2 |
| 5th row | Nil/Negligible filled up |
Common Values
| Value | Count | Frequency (%) |
| Full | 1396 | |
| Upto 1/2 | 607 | |
| Upto 3/4 | 520 | 16.8% |
| Nil/Negligible filled up | 302 | 9.8% |
| Upto 1/4 | 225 | 7.3% |
| (Missing) | 46 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| full | 1396 | |
| upto | 1352 | |
| 1/2 | 607 | |
| 3/4 | 520 | 10.4% |
| nil/negligible | 302 | 6.0% |
| filled | 302 | 6.0% |
| up | 302 | 6.0% |
| 1/4 | 225 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 4302 | |
| 1956 | 8.3% | |
| u | 1698 | 7.2% |
| p | 1654 | 7.0% |
| / | 1654 | 7.0% |
| F | 1396 | 5.9% |
| U | 1352 | 5.7% |
| t | 1352 | 5.7% |
| o | 1352 | 5.7% |
| i | 1208 | 5.1% |
| Other values (10) | 5724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13982 | |
| Uppercase Letter | 3352 | 14.2% |
| Decimal Number | 2704 | 11.4% |
| Space Separator | 1956 | 8.3% |
| Other Punctuation | 1654 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 4302 | |
| u | 1698 | 12.1% |
| p | 1654 | 11.8% |
| t | 1352 | 9.7% |
| o | 1352 | 9.7% |
| i | 1208 | 8.6% |
| e | 906 | 6.5% |
| g | 604 | 4.3% |
| b | 302 | 2.2% |
| f | 302 | 2.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 832 | |
| 4 | 745 | |
| 2 | 607 | |
| 3 | 520 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1396 | |
| U | 1352 | |
| N | 604 |
Space Separator
| Value | Count | Frequency (%) |
| 1956 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1654 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17334 | |
| Common | 6314 | 26.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 4302 | |
| u | 1698 | 9.8% |
| p | 1654 | 9.5% |
| F | 1396 | 8.1% |
| U | 1352 | 7.8% |
| t | 1352 | 7.8% |
| o | 1352 | 7.8% |
| i | 1208 | 7.0% |
| e | 906 | 5.2% |
| N | 604 | 3.5% |
| Other values (4) | 1510 | 8.7% |
Common
| Value | Count | Frequency (%) |
| 1956 | ||
| / | 1654 | |
| 1 | 832 | |
| 4 | 745 | 11.8% |
| 2 | 607 | 9.6% |
| 3 | 520 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23648 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 4302 | |
| 1956 | 8.3% | |
| u | 1698 | 7.2% |
| p | 1654 | 7.0% |
| / | 1654 | 7.0% |
| F | 1396 | 5.9% |
| U | 1352 | 5.7% |
| t | 1352 | 5.7% |
| o | 1352 | 5.7% |
| i | 1208 | 5.1% |
| Other values (10) | 5724 |
filled_up_storage_space_name
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 46 |
| Missing (%) | 1.5% |
| Memory size | 24.3 KiB |
| Filled up every year | |
|---|---|
| Usually filled up | |
| Rarely filled up | |
| Never filled up |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 18.036393 |
| Min length | 15 |
Characters and Unicode
| Total characters | 55011 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rarely filled up |
|---|---|
| 2nd row | Rarely filled up |
| 3rd row | Rarely filled up |
| 4th row | Rarely filled up |
| 5th row | Never filled up |
Common Values
| Value | Count | Frequency (%) |
| Filled up every year | 1336 | |
| Usually filled up | 1002 | |
| Rarely filled up | 577 | |
| Never filled up | 135 | 4.4% |
| (Missing) | 46 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| filled | 3050 | |
| up | 3050 | |
| every | 1336 | |
| year | 1336 | |
| usually | 1002 | 9.6% |
| rarely | 577 | 5.5% |
| never | 135 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 8681 | |
| e | 7905 | |
| 7436 | ||
| y | 4251 | |
| u | 4052 | |
| r | 3384 | 6.2% |
| d | 3050 | 5.5% |
| p | 3050 | 5.5% |
| i | 3050 | 5.5% |
| a | 2915 | 5.3% |
| Other values (7) | 7237 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44525 | |
| Space Separator | 7436 | 13.5% |
| Uppercase Letter | 3050 | 5.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 8681 | |
| e | 7905 | |
| y | 4251 | |
| u | 4052 | |
| r | 3384 | 7.6% |
| d | 3050 | 6.9% |
| p | 3050 | 6.9% |
| i | 3050 | 6.9% |
| a | 2915 | 6.5% |
| f | 1714 | 3.8% |
| Other values (2) | 2473 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1336 | |
| U | 1002 | |
| R | 577 | |
| N | 135 | 4.4% |
Space Separator
| Value | Count | Frequency (%) |
| 7436 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47575 | |
| Common | 7436 | 13.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 8681 | |
| e | 7905 | |
| y | 4251 | |
| u | 4052 | |
| r | 3384 | 7.1% |
| d | 3050 | 6.4% |
| p | 3050 | 6.4% |
| i | 3050 | 6.4% |
| a | 2915 | 6.1% |
| f | 1714 | 3.6% |
| Other values (6) | 5523 |
Common
| Value | Count | Frequency (%) |
| 7436 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55011 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 8681 | |
| e | 7905 | |
| 7436 | ||
| y | 4251 | |
| u | 4052 | |
| r | 3384 | 6.2% |
| d | 3050 | 5.5% |
| p | 3050 | 5.5% |
| i | 3050 | 5.5% |
| a | 2915 | 5.3% |
| Other values (7) | 7237 |
no_people_benefited_by_water_body
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 95 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 191.28941 |
| Minimum | 1 |
|---|---|
| Maximum | 97586 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 7 |
| Q3 | 15 |
| 95-th percentile | 60 |
| Maximum | 97586 |
| Range | 97585 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 3440.2533 |
|---|---|
| Coefficient of variation (CV) | 17.984547 |
| Kurtosis | 550.80154 |
| Mean | 191.28941 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 22.836013 |
| Sum | 592232 |
| Variance | 11835343 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 435 | |
| 10 | 324 | |
| 4 | 303 | 9.8% |
| 2 | 270 | 8.7% |
| 3 | 267 | 8.6% |
| 25 | 203 | 6.6% |
| 5 | 169 | 5.5% |
| 20 | 128 | 4.1% |
| 15 | 123 | 4.0% |
| 8 | 114 | 3.7% |
| Other values (85) | 760 |
| Value | Count | Frequency (%) |
| 1 | 435 | |
| 2 | 270 | |
| 3 | 267 | |
| 4 | 303 | |
| 5 | 169 | 5.5% |
| 6 | 92 | 3.0% |
| 7 | 63 | 2.0% |
| 8 | 114 | 3.7% |
| 9 | 15 | 0.5% |
| 10 | 324 |
| Value | Count | Frequency (%) |
| 97586 | 1 | |
| 90000 | 1 | |
| 80000 | 1 | |
| 72350 | 1 | |
| 54895 | 1 | |
| 50000 | 1 | |
| 35513 | 1 | |
| 22000 | 1 | |
| 12000 | 1 | |
| 5000 | 2 |
reason_water_body_in_use_name2
Categorical
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 2174 |
| Missing (%) | 70.2% |
| Memory size | 24.3 KiB |
| Ground water recharge | |
|---|---|
| Other | |
| Pisciculture | |
| Recreation | |
| Domestic/Drinking | 50 |
| Other values (3) |
Length
| Max length | 21 |
|---|---|
| Median length | 21 |
| Mean length | 16.168113 |
| Min length | 5 |
Characters and Unicode
| Total characters | 14907 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other |
|---|---|
| 2nd row | Ground water recharge |
| 3rd row | Ground water recharge |
| 4th row | Other |
| 5th row | Irrigation |
Common Values
| Value | Count | Frequency (%) |
| Ground water recharge | 529 | 17.1% |
| Other | 129 | 4.2% |
| Pisciculture | 90 | 2.9% |
| Recreation | 64 | 2.1% |
| Domestic/Drinking | 50 | 1.6% |
| Irrigation | 35 | 1.1% |
| Religious | 17 | 0.5% |
| Industrial | 8 | 0.3% |
| (Missing) | 2174 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ground | 529 | |
| water | 529 | |
| recharge | 529 | |
| other | 129 | 6.5% |
| pisciculture | 90 | 4.5% |
| recreation | 64 | 3.2% |
| domestic/drinking | 50 | 2.5% |
| irrigation | 35 | 1.8% |
| religious | 17 | 0.9% |
| industrial | 8 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2527 | |
| e | 2001 | |
| a | 1165 | 7.8% |
| 1058 | 7.1% | |
| t | 905 | 6.1% |
| c | 823 | 5.5% |
| n | 736 | 4.9% |
| u | 734 | 4.9% |
| o | 695 | 4.7% |
| h | 658 | 4.4% |
| Other values (15) | 3605 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12827 | |
| Space Separator | 1058 | 7.1% |
| Uppercase Letter | 972 | 6.5% |
| Other Punctuation | 50 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2527 | |
| e | 2001 | |
| a | 1165 | |
| t | 905 | 7.1% |
| c | 823 | 6.4% |
| n | 736 | 5.7% |
| u | 734 | 5.7% |
| o | 695 | 5.4% |
| h | 658 | 5.1% |
| g | 631 | 4.9% |
| Other values (7) | 1952 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 529 | |
| O | 129 | 13.3% |
| D | 100 | 10.3% |
| P | 90 | 9.3% |
| R | 81 | 8.3% |
| I | 43 | 4.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1058 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 50 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13799 | |
| Common | 1108 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2527 | |
| e | 2001 | |
| a | 1165 | 8.4% |
| t | 905 | 6.6% |
| c | 823 | 6.0% |
| n | 736 | 5.3% |
| u | 734 | 5.3% |
| o | 695 | 5.0% |
| h | 658 | 4.8% |
| g | 631 | 4.6% |
| Other values (13) | 2924 |
Common
| Value | Count | Frequency (%) |
| 1058 | ||
| / | 50 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14907 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2527 | |
| e | 2001 | |
| a | 1165 | 7.8% |
| 1058 | 7.1% | |
| t | 905 | 6.1% |
| c | 823 | 5.5% |
| n | 736 | 4.9% |
| u | 734 | 4.9% |
| o | 695 | 4.7% |
| h | 658 | 4.4% |
| Other values (15) | 3605 |
storage_capacity_change
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 1120 |
|---|---|
| Distinct (%) | 36.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -918026.57 |
| Minimum | -2.6683767 × 109 |
|---|---|
| Maximum | 11840 |
| Zeros | 518 |
| Zeros (%) | 16.7% |
| Negative | 2567 |
| Negative (%) | 82.9% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | -2.6683767 × 109 |
|---|---|
| 5-th percentile | -9000 |
| Q1 | -1800.5 |
| median | -440 |
| Q3 | -9 |
| 95-th percentile | 0 |
| Maximum | 11840 |
| Range | 2.6683885 × 109 |
| Interquartile range (IQR) | 1791.5 |
Descriptive statistics
| Standard deviation | 47990893 |
|---|---|
| Coefficient of variation (CV) | -52.276148 |
| Kurtosis | 3086.8457 |
| Mean | -918026.57 |
| Median Absolute Deviation (MAD) | 440 |
| Skewness | -55.520423 |
| Sum | -2.8422103 × 109 |
| Variance | 2.3031258 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 518 | 16.7% |
| -2 | 121 | 3.9% |
| -10 | 58 | 1.9% |
| -20 | 52 | 1.7% |
| -5 | 45 | 1.5% |
| -1000 | 34 | 1.1% |
| -30 | 26 | 0.8% |
| -500 | 25 | 0.8% |
| -400 | 25 | 0.8% |
| -2000 | 24 | 0.8% |
| Other values (1110) | 2168 |
| Value | Count | Frequency (%) |
| -2668376700 | 1 | |
| -81300000 | 1 | |
| -56400000 | 1 | |
| -27500002 | 1 | |
| -1488400 | 1 | |
| -179800 | 1 | |
| -122400 | 1 | |
| -119900 | 1 | |
| -118608 | 1 | |
| -109200 | 1 |
| Value | Count | Frequency (%) |
| 11840 | 1 | |
| 7950 | 1 | |
| 3600 | 1 | |
| 2500 | 1 | |
| 1050 | 1 | |
| 600 | 1 | |
| 562 | 1 | |
| 420 | 1 | |
| 48 | 1 | |
| 12 | 1 |
population_density_benefited
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 215 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1251011 |
| Minimum | 0 |
|---|---|
| Maximum | 222.36472 |
| Zeros | 3 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1 |
| Q1 | 1.1251011 |
| median | 1.1251011 |
| Q3 | 1.1251011 |
| 95-th percentile | 1.1251011 |
| Maximum | 222.36472 |
| Range | 222.36472 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.9905949 |
|---|---|
| Coefficient of variation (CV) | 3.5468768 |
| Kurtosis | 3055.3143 |
| Mean | 1.1251011 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 55.092668 |
| Sum | 3483.3129 |
| Variance | 15.924848 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.125101077 | 2789 | |
| 0.12 | 8 | 0.3% |
| 0.001 | 7 | 0.2% |
| 1 | 7 | 0.2% |
| 0.4 | 7 | 0.2% |
| 0.15 | 7 | 0.2% |
| 0.06 | 5 | 0.2% |
| 0.1 | 5 | 0.2% |
| 0.5 | 5 | 0.2% |
| 2 | 4 | 0.1% |
| Other values (205) | 252 | 8.1% |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 4 × 10-5 | 1 | < 0.1% |
| 5 × 10-5 | 1 | < 0.1% |
| 7.5 × 10-5 | 1 | < 0.1% |
| 8.33333 × 10-5 | 1 | < 0.1% |
| 9.09091 × 10-5 | 1 | < 0.1% |
| 0.0001 | 1 | < 0.1% |
| 0.000111111 | 2 | |
| 0.000166667 | 1 | < 0.1% |
| 0.000174359 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 222.364725 | 1 | |
| 5.00625 | 1 | |
| 4.55 | 1 | |
| 4.273659199 | 1 | |
| 4.02 | 1 | |
| 4 | 2 | |
| 2.915 | 1 | |
| 2.5 | 1 | |
| 2.4 | 1 | |
| 2.375 | 1 |
| Area_Type | District Name | Original_Storage_Capacity | Present_Storage_Capacity | Reason_for_Water_Body_Use | Renovation_Year | Repair_Renovation_Status | Water_Body_Nature | Water_Body_Type | Water_body_in_use | construcion_year | construction_cost | df_index | filled_up_storage_name | filled_up_storage_space_name | level_0 | no_people_benefited_by_water_body | population_density_benefited | reason_water_body_in_use_name2 | renovation_cost | storage_capacity_change | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Area_Type | 1.000 | 0.097 | 0.143 | 0.126 | 0.150 | -0.137 | 0.000 | 0.084 | 0.104 | 0.043 | -0.064 | 0.029 | 0.016 | 0.048 | 0.000 | 0.016 | 0.050 | -0.056 | 0.076 | -0.009 | -0.087 |
| District Name | 0.097 | 1.000 | 0.254 | 0.253 | 0.467 | -0.362 | 0.125 | 0.756 | 0.398 | 0.563 | -0.193 | -0.261 | 0.025 | 0.360 | 0.372 | 0.025 | -0.071 | -0.066 | 0.363 | -0.638 | -0.220 |
| Original_Storage_Capacity | 0.143 | 0.254 | 1.000 | 0.947 | 0.014 | -0.266 | 0.000 | 0.000 | 0.187 | 0.000 | -0.220 | -0.164 | -0.006 | 0.003 | 0.000 | -0.006 | 0.296 | 0.023 | 0.049 | -0.031 | -0.829 |
| Present_Storage_Capacity | 0.126 | 0.253 | 0.947 | 1.000 | 0.014 | -0.245 | 0.000 | 0.000 | 0.187 | 0.000 | -0.167 | -0.087 | 0.004 | 0.003 | 0.000 | 0.004 | 0.321 | 0.037 | 0.049 | -0.011 | -0.695 |
| Reason_for_Water_Body_Use | 0.150 | 0.467 | 0.014 | 0.014 | 1.000 | -0.175 | 0.094 | 0.719 | 0.380 | 0.999 | 0.106 | -0.221 | 0.061 | 0.335 | 0.411 | 0.061 | -0.559 | -0.064 | 0.274 | -0.209 | 0.290 |
| Renovation_Year | -0.137 | -0.362 | -0.266 | -0.245 | -0.175 | 1.000 | 0.421 | 0.314 | 0.233 | 0.363 | 0.523 | 0.317 | 0.090 | 0.213 | 0.137 | 0.090 | 0.165 | -0.423 | 0.183 | 0.416 | 0.257 |
| Repair_Renovation_Status | 0.000 | 0.125 | 0.000 | 0.000 | 0.094 | 0.421 | 1.000 | 0.000 | 0.119 | 0.000 | -0.008 | -0.020 | 0.010 | 0.000 | 0.000 | 0.010 | 0.033 | -0.085 | 0.000 | 0.151 | 0.042 |
| Water_Body_Nature | 0.084 | 0.756 | 0.000 | 0.000 | 0.719 | 0.314 | 0.000 | 1.000 | 0.377 | 0.219 | NaN | NaN | -0.013 | 0.271 | 0.378 | -0.013 | 0.371 | 0.149 | 0.172 | 0.122 | -0.516 |
| Water_Body_Type | 0.104 | 0.398 | 0.187 | 0.187 | 0.380 | 0.233 | 0.119 | 0.377 | 1.000 | 0.150 | 0.079 | 0.360 | -0.043 | 0.107 | 0.049 | -0.043 | -0.032 | 0.047 | 0.181 | 0.083 | 0.477 |
| Water_body_in_use | 0.043 | 0.563 | 0.000 | 0.000 | 0.999 | 0.363 | 0.000 | 0.219 | 0.150 | 1.000 | -0.107 | 0.206 | -0.044 | 0.603 | 0.645 | -0.044 | 0.537 | -0.076 | 0.141 | 0.113 | -0.002 |
| construcion_year | -0.064 | -0.193 | -0.220 | -0.167 | 0.106 | 0.523 | -0.008 | NaN | 0.079 | -0.107 | 1.000 | 0.130 | 0.010 | 0.109 | 0.110 | 0.010 | -0.199 | 0.358 | 0.106 | 0.258 | 0.203 |
| construction_cost | 0.029 | -0.261 | -0.164 | -0.087 | -0.221 | 0.317 | -0.020 | NaN | 0.360 | 0.206 | 0.130 | 1.000 | -0.008 | 0.000 | 0.038 | -0.008 | 0.131 | 0.150 | 0.124 | 0.599 | 0.263 |
| df_index | 0.016 | 0.025 | -0.006 | 0.004 | 0.061 | 0.090 | 0.010 | -0.013 | -0.043 | -0.044 | 0.010 | -0.008 | 1.000 | 0.085 | 0.088 | 1.000 | -0.013 | 0.010 | 0.121 | 0.182 | 0.002 |
| filled_up_storage_name | 0.048 | 0.360 | 0.003 | 0.003 | 0.335 | 0.213 | 0.000 | 0.271 | 0.107 | 0.603 | 0.109 | 0.000 | 0.085 | 1.000 | 0.624 | 0.014 | -0.131 | -0.224 | 0.134 | -0.080 | -0.076 |
| filled_up_storage_space_name | 0.000 | 0.372 | 0.000 | 0.000 | 0.411 | 0.137 | 0.000 | 0.378 | 0.049 | 0.645 | 0.110 | 0.038 | 0.088 | 0.624 | 1.000 | -0.003 | -0.152 | -0.120 | 0.164 | -0.223 | 0.060 |
| level_0 | 0.016 | 0.025 | -0.006 | 0.004 | 0.061 | 0.090 | 0.010 | -0.013 | -0.043 | -0.044 | 0.010 | -0.008 | 1.000 | 0.014 | -0.003 | 1.000 | -0.013 | 0.010 | 0.121 | 0.182 | 0.002 |
| no_people_benefited_by_water_body | 0.050 | -0.071 | 0.296 | 0.321 | -0.559 | 0.165 | 0.033 | 0.371 | -0.032 | 0.537 | -0.199 | 0.131 | -0.013 | -0.131 | -0.152 | -0.013 | 1.000 | -0.031 | 0.107 | 0.125 | -0.232 |
| population_density_benefited | -0.056 | -0.066 | 0.023 | 0.037 | -0.064 | -0.423 | -0.085 | 0.149 | 0.047 | -0.076 | 0.358 | 0.150 | 0.010 | -0.224 | -0.120 | 0.010 | -0.031 | 1.000 | 0.049 | -0.514 | 0.020 |
| reason_water_body_in_use_name2 | 0.076 | 0.363 | 0.049 | 0.049 | 0.274 | 0.183 | 0.000 | 0.172 | 0.181 | 0.141 | 0.106 | 0.124 | 0.121 | 0.134 | 0.164 | 0.121 | 0.107 | 0.049 | 1.000 | 0.230 | 0.062 |
| renovation_cost | -0.009 | -0.638 | -0.031 | -0.011 | -0.209 | 0.416 | 0.151 | 0.122 | 0.083 | 0.113 | 0.258 | 0.599 | 0.182 | -0.080 | -0.223 | 0.182 | 0.125 | -0.514 | 0.230 | 1.000 | 0.196 |
| storage_capacity_change | -0.087 | -0.220 | -0.829 | -0.695 | 0.290 | 0.257 | 0.042 | -0.516 | 0.477 | -0.002 | 0.203 | 0.263 | 0.002 | -0.076 | 0.060 | 0.002 | -0.232 | 0.020 | 0.062 | 0.196 | 1.000 |
| level_0 | df_index | Area_Type | State Name | District Name | Water_Body_Type | Water_body_in_use | Reason_for_Water_Body_Use | Water_Body_Nature | construcion_year | construction_cost | Renovation_Year | renovation_cost | Repair_Renovation_Status | Original_Storage_Capacity | Present_Storage_Capacity | filled_up_storage_name | filled_up_storage_space_name | no_people_benefited_by_water_body | reason_water_body_in_use_name2 | storage_capacity_change | population_density_benefited | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 0 | other | Natural | NaN | NaN | NaN | NaN | 0 | 23481 | 15654 | Nil/Negligible filled up | Rarely filled up | 4.0 | NaN | -7827 | 1.125101 |
| 1 | 1 | 1 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 0 | other | Man-made | 2019.0 | 80000.0 | NaN | NaN | 0 | 632 | 432 | Nil/Negligible filled up | Rarely filled up | 4.0 | NaN | -200 | 1.125101 |
| 2 | 2 | 2 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 0 | other | Man-made | 2017.0 | 70000.0 | NaN | NaN | 0 | 4600 | 4500 | Upto 1/4 | Rarely filled up | 4.0 | NaN | -100 | 1.125101 |
| 3 | 3 | 3 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 0 | other | Man-made | 2017.0 | 50000.0 | NaN | NaN | 0 | 3000 | 2400 | Upto 1/2 | Rarely filled up | 2.0 | NaN | -600 | 1.125101 |
| 4 | 4 | 4 | Rural | UTTARAKHAND | HARIDWAR | Ponds | 1 | Ground water recharge | Natural | NaN | NaN | NaN | NaN | 0 | 990 | 540 | Nil/Negligible filled up | Never filled up | 2.0 | NaN | -450 | 1.125101 |
| 5 | 5 | 5 | Rural | UTTARAKHAND | HARIDWAR | Ponds | 1 | Ground water recharge | Natural | NaN | NaN | NaN | NaN | 0 | 675 | 240 | Nil/Negligible filled up | Never filled up | 2.0 | NaN | -435 | 1.125101 |
| 6 | 6 | 6 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 0 | other | Natural | NaN | NaN | NaN | NaN | 0 | 6675 | 2670 | Full | Usually filled up | 3.0 | NaN | -4005 | 1.125101 |
| 7 | 7 | 7 | Urban | UTTARAKHAND | HARIDWAR | Ponds | 1 | Ground water recharge | Natural | NaN | NaN | NaN | NaN | 0 | 14536 | 13000 | Upto 1/2 | Rarely filled up | 20.0 | NaN | -1536 | 1.125101 |
| 8 | 8 | 8 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 0 | other | Natural | NaN | NaN | NaN | NaN | 0 | 7614 | 2538 | Upto 1/2 | Rarely filled up | 1.0 | NaN | -5076 | 1.125101 |
| 9 | 9 | 9 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 0 | other | Natural | NaN | NaN | NaN | NaN | 0 | 2358 | 1179 | Upto 1/2 | Rarely filled up | 3.0 | NaN | -1179 | 1.125101 |
| level_0 | df_index | Area_Type | State Name | District Name | Water_Body_Type | Water_body_in_use | Reason_for_Water_Body_Use | Water_Body_Nature | construcion_year | construction_cost | Renovation_Year | renovation_cost | Repair_Renovation_Status | Original_Storage_Capacity | Present_Storage_Capacity | filled_up_storage_name | filled_up_storage_space_name | no_people_benefited_by_water_body | reason_water_body_in_use_name2 | storage_capacity_change | population_density_benefited | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3086 | 3086 | 3086 | Rural | UTTARAKHAND | DEHRADUN | Ponds | 0 | other | Man-made | 2000.0 | 150000.0 | NaN | NaN | 0 | 120000 | 100 | Nil/Negligible filled up | Never filled up | 2.0 | NaN | -119900 | 1.125101 |
| 3087 | 3087 | 3087 | Rural | UTTARAKHAND | DEHRADUN | Ponds | 1 | Irrigation | Man-made | 2001.0 | 50000.0 | NaN | NaN | 0 | 21000 | 21000 | Upto 3/4 | Filled up every year | 8.0 | NaN | 0 | 1.125101 |
| 3088 | 3088 | 3088 | Rural | UTTARAKHAND | NANITAL | Ponds | 1 | Pisciculture | Natural | NaN | NaN | 2016.0 | 8424.0 | 0 | 24 | 20 | Upto 3/4 | Usually filled up | 5.0 | Domestic/Drinking | -4 | 0.002374 |
| 3089 | 3089 | 3089 | Rural | UTTARAKHAND | HARIDWAR | Ponds | 0 | other | Natural | NaN | NaN | NaN | NaN | 0 | 2400 | 1000 | Nil/Negligible filled up | Never filled up | 2.0 | NaN | -1400 | 1.125101 |
| 3090 | 3090 | 3090 | Rural | UTTARAKHAND | HARIDWAR | Ponds | 0 | other | Natural | NaN | NaN | NaN | NaN | 0 | 1500 | 600 | Nil/Negligible filled up | Never filled up | 1.0 | NaN | -900 | 1.125101 |
| 3091 | 3091 | 3091 | Rural | UTTARAKHAND | HARIDWAR | Ponds | 1 | Ground water recharge | Natural | NaN | NaN | NaN | NaN | 0 | 600 | 300 | Full | Filled up every year | 4.0 | NaN | -300 | 1.125101 |
| 3092 | 3092 | 3092 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 1 | Pisciculture | Man-made | 2000.0 | 50000.0 | 2012.0 | 5000.0 | 0 | 5040 | 4000 | Upto 3/4 | Usually filled up | 15.0 | Ground water recharge | -1040 | 0.800000 |
| 3093 | 3093 | 3093 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 1 | Pisciculture | Man-made | 2000.0 | 30000.0 | 2012.0 | 3000.0 | 0 | 1782 | 1200 | Upto 3/4 | Usually filled up | 11.0 | Ground water recharge | -582 | 0.400000 |
| 3094 | 3094 | 3094 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 1 | Pisciculture | Man-made | 2000.0 | 100000.0 | 2012.0 | 10000.0 | 0 | 28569 | 21000 | Upto 3/4 | Usually filled up | 20.0 | Ground water recharge | -7569 | 2.100000 |
| 3095 | 3095 | 3095 | Rural | UTTARAKHAND | UDHAM SINGH NAGAR | Ponds | 1 | Pisciculture | Man-made | 2000.0 | 80000.0 | 2012.0 | 8000.0 | 0 | 15444 | 12000 | Upto 3/4 | Usually filled up | 16.0 | Ground water recharge | -3444 | 1.500000 |